MORRIS: A Distributed File System for Read-Intensive Applications
نویسندگان
چکیده
This paper presents the design and implementation of Modularly Optimized Round-robin ReadIntensive Storage (MORRIS), a file system which provides high throughput for read-intensive applications. NFSStripe, MORRIS’ primary component, is an NFS loopback server that achieves performance competitive with the traditional single-server model by distributing the task of data storage and retrieval over multiple machines. There are two main challenges associated with such a design. The first consists of structuring the underlying storage of the filesystem in such a way as to take advantage of multiple data servers, thereby allowing multiple concurrent read operations to be efficiently executed. We solve this problem by “striping” files across multiple StripeServer data servers in fixed-size blocks. Employing multiple machines to serve data from disk allows our system to fulfill multiple client requests at once, whereas a singleserver arrangement cannot. The second challenge is to ensure filesystem coherence as multiple concurrent client operations are issued to multiple independent servers. We solve this problem, while conferring minimal impact on the system’s performance, by designing a multiple-reader/single-writer locking protocol specifically suited to our system’s data structures.
منابع مشابه
Don't Give Up on Distributed File Systems
Wide-area distributed applications often reinvent the wheel for their storage needs, each incorporating its own special-purpose storage manager to cope with distribution, intermittent failures, limited bandwidth, and high latencies. This paper argues that a distributed file system could provide a reusable solution to these problems by coupling a standard interface with a design suited to widear...
متن کاملROW-FS: A User-Level Virtualized Redirect-on-Write Distributed File System for Wide Area Applications
We propose a virtualization approach to implement redirect-on-write capabilities that overlay a traditional distributed file system. The redirect-onwrite distributed file system (ROW-FS) is implemented via a user-level proxy that is able to selectively steer Network File System (NFS) RPC calls to one of two servers: a “main” read-only server, and a “shadow” read-write server. By employing virtu...
متن کاملFlexible, Wide-Area Storage for Distributed Systems with WheelFS
WheelFS is a wide-area distributed storage system intended to help multi-site applications share data and gain fault tolerance. WheelFS takes the form of a distributed file system with a familiar POSIX interface. Its design allows applications to adjust the tradeoff between prompt visibility of updates from other sites and the ability for sites to operate independently despite failures and long...
متن کاملA Cooperative File System
The Cooperative File System (CFS) is a new peer-to-peer read-only storage system that provides provable guarantees for the efficiency, robustness, and load-balance of file storage and retrieval. CFS does this with a completely decentralized architecture that can scale to large systems. CFS servers provide a distributed hash table (DHash) for block storage. CFS clients interpret DHash blocks as ...
متن کاملFusionFS: a distributed file system for large scale data-intensive computing
Today’s science is generating datasets that are increasing exponentially in both complexity and volume, making their analysis, archival, and sharing one of the grand challenges of the 21st century. Exascale computing, i.e. 10 FLOPS, is predicted to emerge by 2019 with current trends. Millions of nodes and billions of threads of execution, producing similarly large concurrent data accesses, are ...
متن کامل